Overview

Dataset statistics

Number of variables27
Number of observations296
Missing cells407
Missing cells (%)5.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory75.3 KiB
Average record size in memory260.4 B

Variable types

NUM15
BOOL8
CAT3
DATE1

Reproduction

Analysis started2020-05-05 17:13:56.645330
Analysis finished2020-05-05 17:14:31.800674
Versionpandas-profiling v2.5.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml
month is highly correlated with quarter and 1 other fieldsHigh Correlation
quarter is highly correlated with month and 1 other fieldsHigh Correlation
weekofyear is highly correlated with quarter and 1 other fieldsHigh Correlation
meanwd_udsprevisionempresa is highly correlated with meanwd_udsventaHigh Correlation
meanwd_udsventa is highly correlated with meanwd_udsprevisionempresaHigh Correlation
udsstock has 97 (32.8%) missing values Missing
udsventa has 61 (20.6%) missing values Missing
udsprevisionempresa has 79 (26.7%) missing values Missing
roll4wd_udsventa has 50 (16.9%) missing values Missing
meanwd_udsventa has 42 (14.2%) missing values Missing
roll4wd_udsstock has 18 (6.1%) missing values Missing
roll4wd_udsprevisionempresa has 60 (20.3%) missing values Missing
weekday has 42 (14.2%) zeros Zeros
sin_weekday has 42 (14.2%) zeros Zeros
roll4wd_udsprevisionempresa has 5 (1.7%) zeros Zeros

Variables

df_index
Real number (ℝ≥0)

UNIFORM
UNIQUE
Distinct count296
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10641.0
Minimum21
Maximum21261
Zeros0
Zeros (%)0.0%
Memory size2.4 KiB

Quantile statistics

Minimum21
5-th percentile1083
Q15331
median10641
Q315951
95-th percentile20199
Maximum21261
Range21240
Interquartile range (IQR)10620

Descriptive statistics

Standard deviation6162.628011
Coefficient of variation (CV)0.5791399315
Kurtosis-1.2
Mean10641
Median Absolute Deviation (MAD)5328
Skewness0
Sum3149736
Variance37977984
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[2.1000e+01 2.1261e+04], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1533 1 0.3%
 
18453 1 0.3%
 
2757 1 0.3%
 
10893 1 0.3%
 
8805 1 0.3%
 
11901 1 0.3%
 
2253 1 0.3%
 
9957 1 0.3%
 
19605 1 0.3%
 
8301 1 0.3%
 
Other values (286) 286 96.6%
 
ValueCountFrequency (%) 
21 1 0.3%
 
93 1 0.3%
 
165 1 0.3%
 
237 1 0.3%
 
309 1 0.3%
 
ValueCountFrequency (%) 
21261 1 0.3%
 
21189 1 0.3%
 
21117 1 0.3%
 
21045 1 0.3%
 
20973 1 0.3%
 

fecha
Date

UNIFORM
UNIQUE
Distinct count296
Unique (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
Minimum2019-06-05 00:00:00
Maximum2020-03-26 00:00:00
Histogram

producto
Categorical

CONSTANT
REJECTED
Distinct count1
Unique (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
30
296
ValueCountFrequency (%) 
30 296 100.0%
 

Length

Max length2
Mean length2
Min length2
ValueCountFrequency (%) 
Decimal_Number 2 100.0%
 
ValueCountFrequency (%) 
Common 2 100.0%
 
ValueCountFrequency (%) 
ASCII 2 100.0%
 

udsstock
Real number (ℝ≥0)

MISSING
Distinct count108
Unique (%)54.3%
Missing97
Missing (%)32.8%
Infinite0
Infinite (%)0.0%
Mean1130.286432160804
Minimum39.0
Maximum2275.0
Zeros0
Zeros (%)0.0%
Memory size2.4 KiB

Quantile statistics

Minimum39
5-th percentile488.4
Q1840
median1111
Q31409
95-th percentile1848.3
Maximum2275
Range2236
Interquartile range (IQR)569

Descriptive statistics

Standard deviation412.5649791
Coefficient of variation (CV)0.365009229
Kurtosis0.197372151
Mean1130.286432
Median Absolute Deviation (MAD)328.6583167
Skewness0.03699693094
Sum224927
Variance170209.862
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1046 7 2.4%
 
982 6 2.0%
 
1201 6 2.0%
 
762 6 2.0%
 
749 5 1.7%
 
1447 5 1.7%
 
1356 4 1.4%
 
891 4 1.4%
 
1020 4 1.4%
 
1343 4 1.4%
 
Other values (98) 148 50.0%
 
(Missing) 97 32.8%
 
ValueCountFrequency (%) 
39 1 0.3%
 
64 1 0.3%
 
129 1 0.3%
 
211 1 0.3%
 
232 1 0.3%
 
ValueCountFrequency (%) 
2275 1 0.3%
 
2261 1 0.3%
 
2157 1 0.3%
 
1989 1 0.3%
 
1951 1 0.3%
 

udsventa
Real number (ℝ≥0)

MISSING
Distinct count80
Unique (%)34.0%
Missing61
Missing (%)20.6%
Infinite0
Infinite (%)0.0%
Mean585.795744680851
Minimum137.0
Maximum1938.0
Zeros0
Zeros (%)0.0%
Memory size2.4 KiB

Quantile statistics

Minimum137
5-th percentile262
Q1432
median551
Q3698
95-th percentile957
Maximum1938
Range1801
Interquartile range (IQR)266

Descriptive statistics

Standard deviation254.7286657
Coefficient of variation (CV)0.4348421238
Kurtosis6.080875428
Mean585.7957447
Median Absolute Deviation (MAD)181.3127026
Skewness1.780464488
Sum137662
Variance64886.69314
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
511 9 3.0%
 
600 7 2.4%
 
492 7 2.4%
 
590 7 2.4%
 
442 6 2.0%
 
314 6 2.0%
 
649 6 2.0%
 
354 6 2.0%
 
432 6 2.0%
 
698 6 2.0%
 
Other values (70) 169 57.1%
 
(Missing) 61 20.6%
 
ValueCountFrequency (%) 
137 1 0.3%
 
147 1 0.3%
 
157 1 0.3%
 
216 1 0.3%
 
246 4 1.4%
 
ValueCountFrequency (%) 
1938 1 0.3%
 
1741 1 0.3%
 
1603 2 0.7%
 
1407 1 0.3%
 
1170 1 0.3%
 

udsprevisionempresa
Real number (ℝ≥0)

MISSING
Distinct count203
Unique (%)93.5%
Missing79
Missing (%)26.7%
Infinite0
Infinite (%)0.0%
Mean2845.2350230414745
Minimum0.0
Maximum17426.0
Zeros2
Zeros (%)0.7%
Memory size2.4 KiB

Quantile statistics

Minimum0
5-th percentile337.8
Q11172
median2322
Q33609
95-th percentile7185.4
Maximum17426
Range17426
Interquartile range (IQR)2437

Descriptive statistics

Standard deviation2424.726318
Coefficient of variation (CV)0.852205986
Kurtosis7.871113381
Mean2845.235023
Median Absolute Deviation (MAD)1700.164497
Skewness2.210779419
Sum617416
Variance5879297.718
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2322 2 0.7%
 
289 2 0.7%
 
1050 2 0.7%
 
123 2 0.7%
 
2987 2 0.7%
 
2084 2 0.7%
 
1613 2 0.7%
 
712 2 0.7%
 
2078 2 0.7%
 
3589 2 0.7%
 
Other values (193) 197 66.6%
 
(Missing) 79 26.7%
 
ValueCountFrequency (%) 
0 2 0.7%
 
62 1 0.3%
 
98 1 0.3%
 
123 2 0.7%
 
169 1 0.3%
 
ValueCountFrequency (%) 
17426 1 0.3%
 
13665 1 0.3%
 
11310 1 0.3%
 
10044 1 0.3%
 
10028 1 0.3%
 

promo
Boolean

CONSTANT
REJECTED
Distinct count1
Unique (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
0
296
ValueCountFrequency (%) 
0 296 100.0%
 

festivo
Boolean

Distinct count2
Unique (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
0
288
1
 
8
ValueCountFrequency (%) 
0 288 97.3%
 
1 8 2.7%
 

weekday
Real number (ℝ≥0)

ZEROS
Distinct count7
Unique (%)2.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.9966216216216215
Minimum0
Maximum6
Zeros42
Zeros (%)14.2%
Memory size2.4 KiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median3
Q35
95-th percentile6
Maximum6
Range6
Interquartile range (IQR)4

Descriptive statistics

Standard deviation1.997453142
Coefficient of variation (CV)0.6665683542
Kurtosis-1.241520413
Mean2.996621622
Median Absolute Deviation (MAD)1.706560446
Skewness0.004680305814
Sum887
Variance3.989819056
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 0.5 5.5 6. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
3 43 14.5%
 
2 43 14.5%
 
6 42 14.2%
 
5 42 14.2%
 
4 42 14.2%
 
1 42 14.2%
 
0 42 14.2%
 
ValueCountFrequency (%) 
0 42 14.2%
 
1 42 14.2%
 
2 43 14.5%
 
3 43 14.5%
 
4 42 14.2%
 
ValueCountFrequency (%) 
6 42 14.2%
 
5 42 14.2%
 
4 42 14.2%
 
3 43 14.5%
 
2 43 14.5%
 

quarter
Categorical

HIGH CORRELATION
Distinct count4
Unique (%)1.4%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
4
92
3
92
1
86
2
26
ValueCountFrequency (%) 
4 92 31.1%
 
3 92 31.1%
 
1 86 29.1%
 
2 26 8.8%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 4 100.0%
 
ValueCountFrequency (%) 
Common 4 100.0%
 
ValueCountFrequency (%) 
ASCII 4 100.0%
 

month
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count10
Unique (%)3.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.993243243243243
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Memory size2.4 KiB

Quantile statistics

Minimum1
5-th percentile1
Q13
median8
Q310
95-th percentile12
Maximum12
Range11
Interquartile range (IQR)7

Descriptive statistics

Standard deviation3.667533456
Coefficient of variation (CV)0.5244395666
Kurtosis-1.215710455
Mean6.993243243
Median Absolute Deviation (MAD)3.109751644
Skewness-0.3478227975
Sum2070
Variance13.45080165
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1. 1.5 2.5 6.5 11.5 12. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
12 31 10.5%
 
10 31 10.5%
 
8 31 10.5%
 
7 31 10.5%
 
1 31 10.5%
 
11 30 10.1%
 
9 30 10.1%
 
2 29 9.8%
 
6 26 8.8%
 
3 26 8.8%
 
ValueCountFrequency (%) 
1 31 10.5%
 
2 29 9.8%
 
3 26 8.8%
 
6 26 8.8%
 
7 31 10.5%
 
ValueCountFrequency (%) 
12 31 10.5%
 
11 30 10.1%
 
10 31 10.5%
 
9 30 10.1%
 
8 31 10.5%
 

weekofyear
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count43
Unique (%)14.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean28.469594594594593
Minimum1
Maximum52
Zeros0
Zeros (%)0.0%
Memory size2.4 KiB

Quantile statistics

Minimum1
5-th percentile3
Q111
median31
Q342
95-th percentile50
Maximum52
Range51
Interquartile range (IQR)31

Descriptive statistics

Standard deviation15.97664889
Coefficient of variation (CV)0.561182873
Kurtosis-1.229228509
Mean28.46959459
Median Absolute Deviation (MAD)13.65613587
Skewness-0.3266565044
Sum8427
Variance255.2533097
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1. 12.5 23.5 52. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
52 7 2.4%
 
51 7 2.4%
 
29 7 2.4%
 
28 7 2.4%
 
27 7 2.4%
 
26 7 2.4%
 
25 7 2.4%
 
24 7 2.4%
 
12 7 2.4%
 
11 7 2.4%
 
Other values (33) 226 76.4%
 
ValueCountFrequency (%) 
1 7 2.4%
 
2 7 2.4%
 
3 7 2.4%
 
4 7 2.4%
 
5 7 2.4%
 
ValueCountFrequency (%) 
52 7 2.4%
 
51 7 2.4%
 
50 7 2.4%
 
49 7 2.4%
 
48 7 2.4%
 
Distinct count2
Unique (%)0.7%
Missing0
Missing (%)0.0%
Memory size424.0 B
True
246
False
50
ValueCountFrequency (%) 
True 246 83.1%
 
False 50 16.9%
 

sin_weekday
Real number (ℝ)

ZEROS
Distinct count7
Unique (%)2.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.004759498821957385
Minimum-0.9749279121818236
Maximum0.9749279121818236
Zeros42
Zeros (%)14.2%
Memory size2.4 KiB

Quantile statistics

Minimum-0.9749279122
5-th percentile-0.9749279122
Q1-0.7818314825
median0
Q30.7818314825
95-th percentile0.9749279122
Maximum0.9749279122
Range1.949855824
Interquartile range (IQR)1.563662965

Descriptive statistics

Standard deviation0.7086201304
Coefficient of variation (CV)148.8854514
Kurtosis-1.50521649
Mean0.004759498822
Median Absolute Deviation (MAD)0.6270716718
Skewness-0.0106157593
Sum1.408811651
Variance0.5021424891
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[-0.97492791 -0.8783797 0.8783797 0.97492791], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.4338837391 43 14.5%
 
0.9749279122 43 14.5%
 
-0.4338837391 42 14.2%
 
-0.9749279122 42 14.2%
 
-0.7818314825 42 14.2%
 
0.7818314825 42 14.2%
 
0 42 14.2%
 
ValueCountFrequency (%) 
-0.9749279122 42 14.2%
 
-0.7818314825 42 14.2%
 
-0.4338837391 42 14.2%
 
0 42 14.2%
 
0.4338837391 43 14.5%
 
ValueCountFrequency (%) 
0.9749279122 43 14.5%
 
0.7818314825 42 14.2%
 
0.4338837391 43 14.5%
 
0 42 14.2%
 
-0.4338837391 42 14.2%
 

cos_weekday
Real number (ℝ)

Distinct count7
Unique (%)2.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-0.0037955736549281846
Minimum-0.9009688679024191
Maximum1.0
Zeros0
Zeros (%)0.0%
Memory size2.4 KiB

Quantile statistics

Minimum-0.9009688679
5-th percentile-0.9009688679
Q1-0.9009688679
median-0.222520934
Q30.6234898019
95-th percentile1
Maximum1
Range1.900968868
Interquartile range (IQR)1.52445867

Descriptive statistics

Standard deviation0.7079619739
Coefficient of variation (CV)-186.5230498
Kurtosis-1.503349059
Mean-0.003795573655
Median Absolute Deviation (MAD)0.6408877408
Skewness0.009053080122
Sum-1.123489802
Variance0.5012101565
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[-0.90096887 -0.90096887 1. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
-0.222520934 43 14.5%
 
-0.9009688679 43 14.5%
 
-0.222520934 42 14.2%
 
-0.9009688679 42 14.2%
 
0.6234898019 42 14.2%
 
1 42 14.2%
 
0.6234898019 42 14.2%
 
ValueCountFrequency (%) 
-0.9009688679 42 14.2%
 
-0.9009688679 43 14.5%
 
-0.222520934 42 14.2%
 
-0.222520934 43 14.5%
 
0.6234898019 42 14.2%
 
ValueCountFrequency (%) 
1 42 14.2%
 
0.6234898019 42 14.2%
 
0.6234898019 42 14.2%
 
-0.222520934 43 14.5%
 
-0.222520934 42 14.2%
 

is_august
Boolean

Distinct count2
Unique (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
0
265
1
 
31
ValueCountFrequency (%) 
0 265 89.5%
 
1 31 10.5%
 

spring
Boolean

Distinct count2
Unique (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
0
291
1
 
5
ValueCountFrequency (%) 
0 291 98.3%
 
1 5 1.7%
 

summer
Boolean

Distinct count2
Unique (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
0
188
1
108
ValueCountFrequency (%) 
0 188 63.5%
 
1 108 36.5%
 

autumn
Boolean

Distinct count2
Unique (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
0
206
1
90
ValueCountFrequency (%) 
0 206 69.6%
 
1 90 30.4%
 

winter
Boolean

Distinct count2
Unique (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
0
200
1
96
ValueCountFrequency (%) 
0 200 67.6%
 
1 96 32.4%
 

stockMissingType
Categorical

Distinct count3
Unique (%)1.0%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
0
199
2
82
1
 
15
ValueCountFrequency (%) 
0 199 67.2%
 
2 82 27.7%
 
1 15 5.1%
 

Length

Max length3
Mean length3
Min length3
ValueCountFrequency (%) 
Decimal_Number 3 75.0%
 
Other_Punctuation 1 25.0%
 
ValueCountFrequency (%) 
Common 4 100.0%
 
ValueCountFrequency (%) 
ASCII 4 100.0%
 

roll4wd_udsventa
Real number (ℝ≥0)

MISSING
Distinct count235
Unique (%)95.5%
Missing50
Missing (%)16.9%
Infinite0
Infinite (%)0.0%
Mean573.0759146341463
Minimum190.2
Maximum1293.25
Zeros0
Zeros (%)0.0%
Memory size2.4 KiB

Quantile statistics

Minimum190.2
5-th percentile301.3125
Q1460.4642857
median551.1875
Q3693.28125
95-th percentile875.75
Maximum1293.25
Range1103.05
Interquartile range (IQR)232.8169643

Descriptive statistics

Standard deviation172.1052204
Coefficient of variation (CV)0.3003183627
Kurtosis0.5630146335
Mean573.0759146
Median Absolute Deviation (MAD)137.5237316
Skewness0.4154764392
Sum140976.675
Variance29620.20689
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
518.5 3 1.0%
 
606 2 0.7%
 
709.125 2 0.7%
 
700.625 2 0.7%
 
680.75 2 0.7%
 
511 2 0.7%
 
479.25 2 0.7%
 
719.125 2 0.7%
 
539.5 2 0.7%
 
640.375 2 0.7%
 
Other values (225) 225 76.0%
 
(Missing) 50 16.9%
 
ValueCountFrequency (%) 
190.2 1 0.3%
 
207.4285714 1 0.3%
 
211.125 1 0.3%
 
228.25 1 0.3%
 
231.6 1 0.3%
 
ValueCountFrequency (%) 
1293.25 1 0.3%
 
983.375 1 0.3%
 
977.375 1 0.3%
 
944 1 0.3%
 
936.75 1 0.3%
 

meanwd_udsventa
Real number (ℝ≥0)

HIGH CORRELATION
MISSING
Distinct count6
Unique (%)2.4%
Missing42
Missing (%)14.2%
Infinite0
Infinite (%)0.0%
Mean585.5999817451901
Minimum397.2368421052632
Maximum811.8717948717949
Zeros0
Zeros (%)0.0%
Memory size2.4 KiB

Quantile statistics

Minimum397.2368421
5-th percentile397.2368421
Q1461.575
median568.155761
Q3704.375
95-th percentile811.8717949
Maximum811.8717949
Range414.6349528
Interquartile range (IQR)242.8

Descriptive statistics

Standard deviation139.513103
Coefficient of variation (CV)0.2382395959
Kurtosis-1.097016964
Mean585.5999817
Median Absolute Deviation (MAD)115.0452121
Skewness0.2891297061
Sum148742.3954
Variance19463.90591
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
704.375 43 14.5%
 
560.4736842 43 14.5%
 
575.8378378 42 14.2%
 
461.575 42 14.2%
 
397.2368421 42 14.2%
 
811.8717949 42 14.2%
 
(Missing) 42 14.2%
 
ValueCountFrequency (%) 
397.2368421 42 14.2%
 
461.575 42 14.2%
 
560.4736842 43 14.5%
 
575.8378378 42 14.2%
 
704.375 43 14.5%
 
ValueCountFrequency (%) 
811.8717949 42 14.2%
 
704.375 43 14.5%
 
575.8378378 42 14.2%
 
560.4736842 43 14.5%
 
461.575 42 14.2%
 

roll4wd_udsstock
Real number (ℝ≥0)

MISSING
Distinct count248
Unique (%)89.2%
Missing18
Missing (%)6.1%
Infinite0
Infinite (%)0.0%
Mean1158.1083119218913
Minimum234.0
Maximum2261.0
Zeros0
Zeros (%)0.0%
Memory size2.4 KiB

Quantile statistics

Minimum234
5-th percentile696.1571429
Q1892.55
median1135.071429
Q31385.0875
95-th percentile1740.7125
Maximum2261
Range2027
Interquartile range (IQR)492.5375

Descriptive statistics

Standard deviation352.7980329
Coefficient of variation (CV)0.3046330203
Kurtosis0.2689832869
Mean1158.108312
Median Absolute Deviation (MAD)283.7650596
Skewness0.3389982677
Sum321954.1107
Variance124466.452
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1524 6 2.0%
 
1240 4 1.4%
 
1040 3 1.0%
 
930 3 1.0%
 
749 2 0.7%
 
1149.857143 2 0.7%
 
1130 2 0.7%
 
1628 2 0.7%
 
234 2 0.7%
 
1796 2 0.7%
 
Other values (238) 250 84.5%
 
(Missing) 18 6.1%
 
ValueCountFrequency (%) 
234 2 0.7%
 
240.8571429 1 0.3%
 
395.4 1 0.3%
 
465 1 0.3%
 
474 1 0.3%
 
ValueCountFrequency (%) 
2261 1 0.3%
 
2193 1 0.3%
 
2157 1 0.3%
 
2125 1 0.3%
 
2057 1 0.3%
 

meanwd_udsstock
Real number (ℝ≥0)

Distinct count7
Unique (%)2.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1130.710683285964
Minimum801.4516129032259
Maximum1432.423076923077
Zeros0
Zeros (%)0.0%
Memory size2.4 KiB

Quantile statistics

Minimum801.4516129
5-th percentile801.4516129
Q1955.2857143
median1055.258065
Q31378.633333
95-th percentile1432.423077
Maximum1432.423077
Range630.971464
Interquartile range (IQR)423.347619

Descriptive statistics

Standard deviation222.9857939
Coefficient of variation (CV)0.1972085319
Kurtosis-1.491811678
Mean1130.710683
Median Absolute Deviation (MAD)206.2324389
Skewness0.04542798926
Sum334690.3623
Variance49722.66427
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 801.4516129 1020.58903226 1340.26494253 1405.52820513 1432.42307692], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1378.633333 43 14.5%
 
1055.258065 43 14.5%
 
1432.423077 42 14.2%
 
801.4516129 42 14.2%
 
985.92 42 14.2%
 
1301.896552 42 14.2%
 
955.2857143 42 14.2%
 
ValueCountFrequency (%) 
801.4516129 42 14.2%
 
955.2857143 42 14.2%
 
985.92 42 14.2%
 
1055.258065 43 14.5%
 
1301.896552 42 14.2%
 
ValueCountFrequency (%) 
1432.423077 42 14.2%
 
1378.633333 43 14.5%
 
1301.896552 42 14.2%
 
1055.258065 43 14.5%
 
985.92 42 14.2%
 

roll4wd_udsprevisionempresa
Real number (ℝ≥0)

MISSING
ZEROS
Distinct count230
Unique (%)97.5%
Missing60
Missing (%)20.3%
Infinite0
Infinite (%)0.0%
Mean2862.405054479419
Minimum0.0
Maximum17426.0
Zeros5
Zeros (%)1.7%
Memory size2.4 KiB

Quantile statistics

Minimum0
5-th percentile196.75
Q11270.642857
median2367.875
Q33471.46875
95-th percentile7589.09375
Maximum17426
Range17426
Interquartile range (IQR)2200.825893

Descriptive statistics

Standard deviation2497.345349
Coefficient of variation (CV)0.8724639951
Kurtosis7.670456713
Mean2862.405054
Median Absolute Deviation (MAD)1702.768456
Skewness2.268021193
Sum675527.5929
Variance6236733.794
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0 5 1.7%
 
206 2 0.7%
 
123 2 0.7%
 
2832.875 1 0.3%
 
2623 1 0.3%
 
1371.125 1 0.3%
 
13794.5 1 0.3%
 
951.25 1 0.3%
 
1868.25 1 0.3%
 
2588.125 1 0.3%
 
Other values (220) 220 74.3%
 
(Missing) 60 20.3%
 
ValueCountFrequency (%) 
0 5 1.7%
 
62 1 0.3%
 
98 1 0.3%
 
111.25 1 0.3%
 
123 2 0.7%
 
ValueCountFrequency (%) 
17426 1 0.3%
 
13794.5 1 0.3%
 
13665 1 0.3%
 
11310 1 0.3%
 
11131.75 1 0.3%
 

meanwd_udsprevisionempresa
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count7
Unique (%)2.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2301.6070133419885
Minimum206.0
Maximum4904.307692307692
Zeros0
Zeros (%)0.0%
Memory size2.4 KiB

Quantile statistics

Minimum206
5-th percentile206
Q1692.7692308
median2219.675676
Q33566.384615
95-th percentile4904.307692
Maximum4904.307692
Range4698.307692
Interquartile range (IQR)2873.615385

Descriptive statistics

Standard deviation1496.422306
Coefficient of variation (CV)0.6501641232
Kurtosis-0.8705206805
Mean2301.607013
Median Absolute Deviation (MAD)1204.94844
Skewness0.2685388148
Sum681275.6759
Variance2239279.717
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 206. 449.38461538 2030.82501733 2430.79836415 4904.30769231], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2641.921053 43 14.5%
 
3566.384615 43 14.5%
 
4904.307692 42 14.2%
 
2219.675676 42 14.2%
 
692.7692308 42 14.2%
 
1841.974359 42 14.2%
 
206 42 14.2%
 
ValueCountFrequency (%) 
206 42 14.2%
 
692.7692308 42 14.2%
 
1841.974359 42 14.2%
 
2219.675676 42 14.2%
 
2641.921053 43 14.5%
 
ValueCountFrequency (%) 
4904.307692 42 14.2%
 
3566.384615 43 14.5%
 
2641.921053 43 14.5%
 
2219.675676 42 14.2%
 
1841.974359 42 14.2%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Missing values

Sample

First rows

df_indexfechaproductoudsstockudsventaudsprevisionempresapromofestivoweekdayquartermonthweekofyearworking_daysin_weekdaycos_weekdayis_augustspringsummerautumnwinterstockMissingTyperoll4wd_udsventameanwd_udsventaroll4wd_udsstockmeanwd_udsstockroll4wd_udsprevisionempresameanwd_udsprevisionempresa
0212019-06-05301266.0738.011310.00.00.022623True0.974928-0.222521001000.0738.00560.4736841266.01055.25806511310.002641.921053
1932019-06-0630NaN944.017426.00.00.032623True0.433884-0.900969001002.0944.00704.375000NaN1378.63333317426.003566.384615
21652019-06-0730NaN836.013665.00.00.042623True-0.433884-0.900969001002.0836.00811.871795NaN1301.89655213665.004904.307692
32372019-06-0830NaN295.02876.00.00.052623True-0.974928-0.222521001002.0295.00397.236842NaN1432.4230772876.00692.769231
43092019-06-0930NaNNaNNaN0.00.062623False-0.7818310.623490001002.0NaNNaNNaN955.285714NaN206.000000
53812019-06-1030NaN511.05371.00.00.002624True0.0000001.000000001002.0511.00575.837838NaN985.9200005371.002219.675676
64532019-06-1130849.0541.03684.00.00.012624True0.7818310.623490001000.0541.00461.575000849.0801.4516133684.001841.974359
75252019-06-12301508.0492.01661.00.00.022624True0.974928-0.222521001000.0676.50560.4736841326.51055.2580658897.752641.921053
85972019-06-13301938.0698.02900.00.00.032624True0.433884-0.900969001000.0882.50704.3750001938.01378.63333313794.503566.384615
96692019-06-14301356.01033.03532.00.00.042624True-0.433884-0.900969001000.0885.25811.8717951356.01301.89655211131.754904.307692

Last rows

df_indexfechaproductoudsstockudsventaudsprevisionempresapromofestivoweekdayquartermonthweekofyearworking_daysin_weekdaycos_weekdayis_augustspringsummerautumnwinterstockMissingTyperoll4wd_udsventameanwd_udsventaroll4wd_udsstockmeanwd_udsstockroll4wd_udsprevisionempresameanwd_udsprevisionempresa
286206132020-03-173039.01741.01375.00.00.011312True0.7818310.623490000010.0545.750000461.575000395.400000801.4516132486.0001841.974359
287206852020-03-1830NaN551.02265.00.00.021312True0.974928-0.222521000012.0508.500000560.473684465.0000001055.2580653413.6252641.921053
288207572020-03-1930762.01407.03526.00.00.031312True0.433884-0.900969000010.0621.750000704.3750001153.0000001378.6333334447.2503566.384615
289208292020-03-20301511.01938.03892.00.00.041312True-0.433884-0.900969000010.01293.250000811.8717951295.0000001301.8965528129.1254904.307692
290209012020-03-21302275.0246.0NaN0.00.051312True-0.974928-0.222521000010.0765.625000397.2368421254.5000001432.423077NaN692.769231
291209732020-03-22301410.0NaNNaN0.00.061312False-0.7818310.623490010010.0NaNNaN1410.000000955.285714NaN206.000000
292210452020-03-23301410.0NaN473.00.00.001313True0.0000001.000000010010.0579.857143575.8378381410.000000985.9200002009.1252219.675676
293211172020-03-2430936.0NaN440.00.00.011313True0.7818310.623490010010.0935.714286461.575000240.857143801.4516131839.8751841.974359
294211892020-03-25301004.0NaN1153.00.00.021313True0.974928-0.222521010010.0542.142857560.473684644.0000001055.2580652661.0002641.921053
295212612020-03-26301515.0NaN797.00.00.031313True0.433884-0.900969010010.0819.000000704.375000950.2500001378.6333333599.8753566.384615